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with r a rational function and fi a complex-valued Dini-continuous function 
on a real segment [a, h] C ( — 1, 1) which does not vanish, and whose argument 
is of bounded variation. Here ujya,b] stands the normalized arcsine distribution 
on [a, b]. 
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1. Introduction 

Best rational approximation of given degree to a holoniorphic function, in the least 
squares sense on the boundary of a disk included in the domain of analyticity, is 
a classical issue for which early references are [22, 30, 26, 20, 17]. The interplay 
between complex and Fourier analysis induced by the circular symmetry confers to 
such an approximation a natural character, and the corresponding approximants 
provide one with nice examples of locally convergent sequences of diagonal multi- 
point Fade intcrpolants. The problem can be recast as best rational approximation 
of given degree in the Hardy space of the unit disk and also, upon reflecting 
the functions involved across the unit circle, in the Hardy space of the complement 
of the disk which is the framework we shall really work with. 

Because of the natural isometry between Hardy spaces of the disk and the 
half-plane, that preserves rationality and the degree [25, Ch. 8], the question can 
equivalently be stated as best rational L^-approximation of given degree on the 
line to a function holomorphic in a half-plane. 



Abstract. For all n large enough, we show uniqueness of a critical point in 
best rational approximation of degree n, in the L^-sense on the unit circle, to 
functions of the form 
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Further motivation for this type of approximation stems from Control Theory 
and Signal Processing. Indeed, the transfer-function of a stable linear control sys- 
tem belongs to the Hardy space of the half-plane or of the complement of the disk, 
depending whether the setting is in continuous or discrete time, and it is rational 
if the system is finitc-dimcnsional. Moreover, by Parseval's theorem, the L"^ norm 
on the line or the circle of this transfer function coincides with the norm of the 
underlying convolution operator from L^[0,(X)) to L°°[0,oo) in the time domain 
[19]. Further, in a stochastic context, it coincides with the variance of the output 
when the input is white noise [24] . Therefore approximating the transfer function 
by a rational function of degree n, in of the line or the circle, is tantamount 
to identify the best system of order n to model the initial system with respect to 
the criteria just mentioned. Also, since any stationary regular stochastic process is 
the output of a linear control system fed with white noise [29, 18], this approxima- 
tion yields the best ARMA-process to model the initial process while minimizing 
the variance of the error. A thorough discussion of such connections with System 
Theory, as well as additional references, can be found in [2]. 

From the constructive viewpoint no algorithm is known to constructively 
solve the question we raised, and from a computational perspective this is a typical 
non-convex minimization problem whose numerical solution is often hindered by 
the occurence of local minima. It is therefore of major interest in practice to 
establish conditions on the function to be approximated that ensure uniqueness 
of a local minimum. This turns out to be difficult, like most uniqueness issues in 
nonlinear approximation. 

New ground for the subject was broken in [1] , where a differential-topological 
method was introduced to approach the uniqueness issue for critical points, i.e. 
stationary points of the approximation error. Uniqueness of a critical point implies 
uniqueness of a local minimum, but is a stronger property which is better suited to 
analysis. In fact, the above-mentioned method rests on the so-called index theorem 
[5] that provides us with a relation between the Morse indices of the critical points, 
thereby reducing the proof of uniqueness, which is a global property, to checking 
that each critical point has Morse index 0, which is a local issue. The latter is in 
turn equivalent to each critical point being a non-degenerate local minimiim. 

This approach was taken in [12] to handle the case where the approximated 
function is of Markov type, that is, the Cauchy transform of a positive measure 
on a real segment, when that measure is supported within some absolute bounds. 
Subsequently, in [8], the property of being a local minimum was connected to 
classical interpolation theory and the technique was applied to prove asymptotic 
uniqueness of a critical point in best rational approximation to e^/^/z on the 
unit circle, as well as in best rational approximation to generic holomorphic 
functions over small circles; here, asymptotic uniqueness means uniqueness in de- 
gree n for all sufficiently large n. The criterion derived in [8] for being a local 
minimum was further refined in [11], where it is shown that asymptotic unique- 
ness of a critical point holds for Markov functions whose defining measure satisfies 
the Szego condition. The result is sharp in that the Szego condition cannot be 
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omitted in general [10]. A general condition on the logarithmic derivative of the 
approximated function was derived [3] but it only ensures uniqueness in degree 1. 
A criterion for best approximation in degree n — 1 to a rational function of degree 
n can further be found in [9, Thm. 9.1], based on fast geometric decay of the error 
in lower degree. 

Altogether, these works indicate the fact, perhaps unexpected, that unique- 
ness of a critical point in best rational approximation is linked to a regular 
decrease of the error. 

The present paper can be viewed as a sequel to [11]. Indeed, the latter refer- 
ence expressed hope that the techniques set up there could be adapted to handle 
more general Cauchy integrals than Markov functions. Below, we take a step to- 
wards carrying out this program. Specifically, we consider Cauchy transforms of 
complex measures that are absolutely continuous with respect to the equilibrium 
distribution on a real segment [a,b] C (—1,1). The density will be required to be 
Dini-smooth and non- vanishing. In addition, it should admit an argument function 
of bounded variation on [a, b]. Moreover, we handle with little extra-pain the case 
where a rational function is added to such a Cauchy tranform. 

For functions of this kind, we establish an analog to [11, Thm. 1.3], namely 
asymptotic uniqueness of a critical point in best rational approximation for the L^- 
norm on the unit circle. This is the first uniqueness result in degree greater than 
1 for Cauchy integrals with complex densities, more generally for non-rational 
functions without conjugate syimnetry. In contrast, say, to [12, Thm. 3], it is only 
fair to say that such a statement is not really constructive in that no estimate is 
provided for the degree beyond which uniqueness prevails. However, considering 
our restricted knowledge on uniqueness in non-linear complex approximation, our 
result sheds considerable light on the behaviour of best rational approximants to 
Cauchy transforms, and it is our hope that suitable refinements of the technique 
will eventually produce effective bounds. 

Our method of proof follows the same pattern as [11]. Namely, the index 
theorem is invoked to reduce the question of uniqueness to whether each critical 
point is a non-degenerate local minimum. Next, a criterion for being a local mini- 
mum is set up, based on a comparison between the error function generated by the 
critical point under examination and the error function attached to a particular 
multipoint Fade interpolant of lower degree; we call it for short the comparison 
criterion. Finally, the fact this criterion applies when the degree is sufficiently large 
depends on strong asymptotic formulas for the error in rational interpolation to 
functions of the type we consider, that were recently obtained in [15, 13, 31], and 
on a specific design of interpolation nodes to build the particular multipoint Fade 
interpolant of lower degree that we need. 

With respect to [11], however, two main differences arise. The first is that 
the comparison criterion was set up there for conjugate-symmetric functions only, 
i.e. for those functions having real Fourier coefficients. Because we now adress 
Cauchy transforms of complex densities on a segment, we handle complex Fourier 
coefficients as well. Although the corresponding changes are mostly mechanical, 
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they have to be carried out thoroughly for they impinge on the computation of 
the Hessian quadratic form and on the nondegcncracy thereof. 

The second difference causes more serious difficulties. Indeed, the construc- 
tion of the special interpolant of lower degree needed to apply the comparison crite- 
rion requires rather precise control on the poles of multipoint Pade interpolants to 
the approximated function. For Markov functions, it is known that such poles are 
the zeros of certain orthogonal polynomials with respect to a positive measure on 
the segment [a,b], therefore they lie on that segment. But for Cauchy transforms 
of complex measures, the poles are the zeros of some non-Hermitian orthogonal 
polynomial with respect to a complex measure on [a,b], and their behaviour does 
not lend itself to analysis so easily. We resort here to the work in [4] on the ge- 
ometry of non-Hermitian orthogonal polynomials to overcome this difficulty, and 
this is where the boundedness of the variation of the density's argument becomes 
important. 

Let us briefly indicate some generalizations of our results that were not in- 
cluded here. Firstly, asymptotic uniqueness of a critical point extends to Cauchy 
transforms of absolutely continuous measures whose density with respect to 
Lehesgue (ratlica' than equilibrium) measure satisfies similar assumptions, e.g. non- 
vanishing and Holder-smoothness; in fact, densities with respect to any Jacobi 
weight could be handled the same way under suitable regularity requirements. 
We did not mention them, however, bccaiise such an extension depends on asymp- 
totics for non-Hermitian orthogonal polynomials with respect to this type of weight 
which are yet unpublished [14]. 

Secondly, finitely many zeros in the density would still be acceptable, provided 
they are of power type with sufficiently small exponent. Developing the precise 
estimates would make the paper heavier (compare [13, Thm. 4] and [31, Thm. 5]), 
so we felt better omitting this stronger version. 

The organization of the article is as follows. In Section 2 we present the ra- 
tional approximation problem under study and we state the main result of the 
paper, which is Theorem 1. Section 3 introduces the critical points in _ff ^-rational 
approximation and develops their interpolating properties; this part is adapted 
to complex Fourier coefficients from [8]. The index theorem and the comparison 
criterion are expounded in section 4, paralleling the treatment for real Fourier 
coefficients given in [11]. Section 5 recalls the necessary material on interpolation 
from [15, 13, 31], which is needed to carry out the comparison between critical 
points and interpolants of lower degree required in the comparison criterion. Fi- 
nally, elaborating on [11], we prove Theorem 1 in Section 6. 

2. Notation and Main Results 

Let T be the unit circle and D the unit disk. We let (resp. L°°) stand for the 
space of square-integrable (resp. essentially bounded) measurable functions on T. 
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Denote by i?^ the familiar Hardy space of the unit disk, consisting of those L^- 
functions whose Fourier coefRcients of strictly negative index are zero. The space 
identifies with traces on T of those holomorphic functions in D whose L^-means 
on circles centered at zero with radii less than 1 are uniformly bounded above. In 
fact, any such function has non-taiigcntial boundary values almost everywhere on 
T that define a member of from which the function can be recovered by means 
of a Cauchy integral [21]. The space of bounded holomorphic functions on D is 
denoted by and is endowed with the L°° norm of the trace. 

Let Hq be the orthogonal complement of H'^ in with respect to the stan- 
dard scalar product 

(/,fl):= //(^)^^' /'^e^'- (2-1) 

The space Hq, in turn, identifies with traces of those holomorphic functions in 
C \ B) that vanish at infinity and whose L^-means on circles centered at zero with 
radii greater then 1 are uniformly bounded above. In what follows, we denote by 
II • II2 the norm on L^, , and induced by the scalar product (2.1). On one 
occasion, we shall refer to the Hardy space Hq{C \ Bp), which is defined similarly 
except that D gets replaced by Dp := {\z\ < p} where p > 0. 

Set Vn for the space of algebraic polynomials of degree at most n and A4n for 
the subset of monic polynomials with exactly n zeros in D. Note that q <E Ain if and 
only if 1/g e Hq and l/q{z) ~ z~" at infinity. From the differential viewpoint, we 
regard Vn as C""*"^ and A4n as an open subset of C", upon taking the coefficients 
as coordinates (except for the leading coefficient in A4„ which is fixed to unity). 

Define 

It is easy to check that TZn consists of those rational functions of degree at most 
n that belong to Hq, and we endow it with the corresponding topology. Coordi- 
natizing Vn-i and Al„ as above, it is straightforward to see that the canonical 
surjection J : Vn-i x A^n T^n is smooth (i.e., infinitely differentiable) when 
viewed as a _ffQ-vahied map. Note that J is not injcctive, due to possible cancel- 
lation between p and g, but it is a local homeomorphism at every pair (p, q) such 
that p, q are coprime. 

We shall be concerned with the following problem. 

Problem 1: Given f E Hq and n £ N, find r G 7^„ to minimize \\f — r\\2- 

Let us point out two equivalent formulations of Problem 1 that account for 
early discussion made in the introduction. 

Firstly, it is redundant to assume that r lies in Hq, as is subsumed in the 
definition of TZn- Indeed, by partial fraction expansion, a rational function of degree 
at most n in can be written as ri +r2, where ri e Hq and r2 € H"^ have degree 
at most n. By orthogonality of Hq and H'^, we get ||/ — r||2 = ||/ — ri||2 + ||''2||2 
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so that ri is a better candidate approximant than r, showing that Problem 1 is in 
fact equivalent to best rational approximation of given degree to / in L^. 

Secondly, composing with z i— » 1/z, which is an L^-isometry mapping Hq 
onto zH'^ while preserving rationality and the degree, Problem 1 transforms to best 
approximation in of functions vanishing at the origin by rational hmctions of 
degree at most n that vanish at the origin as well. However, by Parseval's identity, 
any best rational approximant to g in has value ^(O) at 0. Therefore Problem 
1 is equivalent to 

Problem 2: Given g G H"^ and n e N, find a rational r of degree at most n in 
to minimize \\f — r\\2- 

Problem 1 is the one we shall work with, and we refer to it as the rational 
Hq -approximation problem to f in degree n. It is well-known (see [3, Prop. 3.1] 
for a proof and further bibliography on the subject) that the minimum is attained 
and that a minimizing r, called a best rational approximant of degree n to f, lies 
in TZn \ TZn-i unless / € TZn-i- Uniqueness of such an approximant is a delicate 
matter. Generically, there is only one best approximant by a theorem of Stechkin 
on Banach space approximation from approximately compact sets [16]. However, 
the proof is non-constructive and does not allow us to determine which functions 
have a unique best approximant and which functions do not. Moreover, from the 
computational viewpoint, the main interest lies not so much with uniqueness of a 
best approximant, but rather with uniqueness of a local best approximant for such 
places are all what a numerical search can usually spot. By definition, a local best 
approximant is a function r; e TZn such that ]]/ — r/jj2 < ]]/ — ?'ll2 for all r e TZn 
in some neighborhood of ri . Like best approximants, local best approximants lie 
in TZn \ TZn-i unless / e TZn-i [3]. 

Still more general is the notion of a critical point, which is defined as follows. 
Fix f € Hq and put 

^f,n ■■ TZn -> [0, OO) 

r - llZ-r-lli 

A pair {p,q) G T'n-i x is called critical if all partial derivatives of ^f^n ° J 
vanish at {p, q) . Subsequently, a rational function Tc G TZ,,, is said to be a critical 
point of ^f,n if there is a critical pair (pc, Qc) such that rc = Pc/Qc- Critical points 
fall into two classes: they are termed irreducible if they have exact degree n, and 
reducible if they have degree strictly less than n. Note that is irreducible if 
and only if Pc and Qc are coprime in some (hence any) representation = Pc/Qc- 
Clearly a local best approximant is a particular instance of a critical point, and it 
is irreducible unless / G T^n-i- 

In the present work, we dwell on a differential topological approach to unique- 
ness of a critical point introduced in [1] and further developed in [5, 2]. In this 
approach, global imiqucncss is deduced from local analysis of the map Specif- 
ically, to conclude there is only one critical point, which is therefore the unique 
local minimum (and a fortiori the global minimum as well), one needs to show that 
each critical point is irreducible, does not interpolate the approximated function 
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on T, and is a nondcgcnerate local minimum; here nondegenerate means that the 
second derivative is a nonsingular quadratic form. This method turns out to be 
fruitful when studying rational approximation to Cauchy transforms of measures 
supported in (—1, 1), i.e., functions of the form 

supp(m)c(-1,1). (2.4) 

The first result in this direction was obtained in [12, Thm. 3] when is a Markov 
function, meaning that /x in (2.4) is a positive measure. It goes as follows. 

Theorem A. Let ^ he a positive measure supported on [a,b] C (—1,1) where a 
and b satisfy b — a < (l — max {a^, 6^}) . Assume further that fj, has at least n 
points of increase, i.e., ^ Ti-n-i- Then there is a unique critical point in rational 
Hq -approximation of degree n to f^. 

Removing the restriction on the size of the support makes the situation more 
difficult. The following theorem [11, Thm. 1.3] asserts that rational approximants 
are asymptotically unique for Markov functions whose defining measure is suffi- 
ciently smooth. Hereafter, we denote by uj[a,b\ the normalized arcsine distribution 
on [a,b] given by diO[a,b]{t) = dt/{■JT^/{t - a){b - t)). 

Theorem B. Let ji be a positive measure supported on [a,b] C (—1,1) and let us 
write dii — ii'dt + d/is, where jig is singular and ji' is integrable on [a,b\. If ji 
satisfies the Szego condition: J logi2'doj^a,b] > — oo, then there is a unique critical 
point in rational Hq -approximation of degree n to f^ for all n large enough. 

As an additional piece of information, the following negative result [10, Thm. 
5] shows that the asymptotic nature of the previous theorem is indispensable. 

Theorem C. For each uq & N there exists a positive measure n satisfying the 
Szego condition such that for each odd n between 1 and no there exist at least two 
different best rational approximants of degree n to f^. 

Our goal is to extend Theorem B to a class of complex measures which is 
made precise in the definition below. Recall that a function h with modulus of 
continuity iVh is said to be Dini- continuous if iOh{t)/t is integrable on [0, e] for 
some (hence any) e > 0. 

Definition (Class M). A measure fx is said to belong to the class M i/supp(/i) C 
(—1,1) is an interval, say [a,b], and dji = jjLduj[a,b]j where fi is a Dini- continuous 
non-vanishing function with an argument of bounded variation on [a, b]. 

Observe that we deal here with fi, the Radon-Nikodym derivative of /U with 

respect to the arcsine distribution, rather then with /i', the Radon-Nikodym de- 
rivative with respect to the Lebesgue measure. Our main result is: 

Theorem 1. Let f '■= f/j, + r, where G M and r € TZm has no poles on supp(/Lt). 
Then there is a unique critical point in rational Hq- approximation of degree n to 
f for all n large enough. 
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Before we can prove the theorem, we must study in greater detail the struc- 
ture of critical points, which is the object of Sections 3 and 4 to come. 



3. Critical Points 

The following theory was developed in [5, 6, 7, 8] when the function / to be approx- 
imated is conjugate-symmetric, i.e., f{z) — ,f{z), and the rational approximants 
are seeked to be conjugate-symmetric as well. In other words, when a function with 
real Fourier- Taylor expansion at infinity gets approximated by a rational function 
with real coefficients. Surprisingly enough, this is not subsumed in Problem 1 
in that conjugate-symmetric functions need not have a best approximant out of 
TZn which is conjugate-symmetric. For Markov functions, though, it is indeed the 
case [4]. Below, we develop an analogous theory for Problem 1, that is, without 
conjugate-symmetry assumptions. This involves only technical modifications of a 
rather mechanical nature. 

Hereafter, for any / G L^, we set /"^(z) := {l/z)f{l/z). Clearly, / 
is an isometric involution mapping onto Hq and vice-versa. Further, for any 
p G Vk, 'we set p{z) := p{l/z) and define its reciprocal polynomial (in Vk) to be 
p{z) := z^p{z) = z^p{l/z). Note that p has the same modulus as p on T and its 
zeros arc reflected from those of p across T. 

3.1. Critical Points as Orthogonal Projections 

Fix / e Hq and let := be given by (2.3). It will be convenient to use 
complex partial derivatives with respect to pj, qk,Pj, Qk, where, for < j,k < n— 1, 
the symbols pk and qk refer to the coefficients of p G Vn-i and q G A^„ as in 
equation (2.2). By complex derivatives we mean the standard Wirtinger operators, 
e.g., if pj = Xj + iyj is the decomposition into real and imaginary part then 
d/dpj = (d/dxj — id/dyj)/2 and d/dpj = (d/dxj + id/dyj)/2. The standard rules 
for derivation are still valid, obviously dg(pj)/dpj = if <? is holomorphic, and it 
is straightforward that dh/dpj = dh/dpj for any function h. In particular, since 
is real, 

7^ ^=7?^. (3.1) 



dpj \ dpj J dqk V 9qk 

Thus, writing (E>„o J(p, q) = {f—p/q, f—p/q) and differentiating under the integral 
sign, we obtain that a critical pair {pc, Qc) of $„o J is characterized by the relations 

^"^"(Pcgc) = /-,./-^\=0, jG{0,...,n-l}, (3.2) 



op'j \ qc q, 

^"^"(Pc^c) = -/^,/-^\=0, fce{0,...,n-l}. (3.3) 



dqk \ qi qc 

Equation (3.2) means that Pc/qc is the orthogonal projection of / onto Vg^, where 
for any q G A4n we let Vg := {p/q : p G Vn-i} to be the n-dimensional linear 
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2 • (3.6) 



subspace of Hq consisting of rational functions with denominator q. In what fol- 
lows, we consistently denote the orthogonal projection of / onto Vq by Lg/q, where 
Lq e Pn-i is uniquely characterized by the fact that 

/-— ,-\=0 for any peVn-i- (3.4) 

q q/ 

Taking equation (3.3) into account, we conclude from what precedes that critical 
points of are precisely 7^„-functions of the form Lq^/qc, where qc G M.n satisfies 

'^-^,/-^\=0, fce{0,...,n-l}. (3.5) 

qc qc I 

Now, it is appearent from (3.4) that Lq is a smooth function of q, therefore we 
define a smooth map on M.n by setting 

*n = */,n:A^n ^ [0, oo) 

q ^ \\!-^qlq\\i 

By construction, <&„ attains a local niiniimim at r = Lq^ / qi if and only if attains 
a local minimum at qi, and the assumed values arc the same. More generally, 
r G TZn is a critical point of $„ if and only if r = Lq^/qc, where qc € Mn, is a 
critical point of This is readily checked upon comparing (3.5) with the result 
of the following computation: 

d^n ^ / {dLq/dqk)q-z^Lq ^ Jf ^1 {dLq/dqk) 

dqk \ q^ ' q / \ q ' q 

'-^J-^), fce{0,...,n-l}, (3.7) 

q^ q I 

where we applied (3.4) using that the derivatives of Lq lie in Vn-\ and that 
dq/dqk = 0. For simplicity, we drop from now on the subscript "c" we used so 
far as a mnemonic for "critical". Altogether we proved the following result: 

Proposition 1. For f e Hq, let $n and \l/n be defined by (2.3) and (3.6), respec- 
tively. Then r € TZn is a critical point of if and only if r = Lq/q and q G A4n 
is a critical point of . 

In view of Proposition 1 , we shall extend to the terminology introduced 
for and say that a critical point q G ^An of is irreducible if Lq and q are 
coprime. 

3.2. Interpolation Properties of Critical Points 

If we denote with a superscript "_L" the orthogonal complement in Hq, it is ele- 
mentary to check that 

Vq^ = i^^u: u&H^Y ill = V<i® (3-8) 

Hence, by (3.4), there exists Uq = Uf^q G Hq such that 

fq-Lq = quq. (3.9) 
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Relation (3.9) means that Lq/q interpolates / at the reflections of the zeros of q 
across T. Assume now that q G A4n is a critical point of Then, combining 
(3.9) and (3.7), we derive that 



p{r) {Lqu-){T) dT 
q{T) qir) 27ri ' 



(3.10) 

Since LqU^/q e H^, we see by letting p range over elementary divisors of q and 
applying the residue formula that (3.10) holds if and only if each zero of gr is a 
zero of Lqu'^ of the same multiplicity or higher. That is, g is a critical point of 
if and only if q divides LqU^ in H^. 

Let d e Mk be the monic g.c.d. of Lq and q, with < < n — 1. Writing 
Lq = dp* and q = dq*, where p* and q* are coprime, we deduce that q* divides Uq 
in or cquivalcntly that Uq = q*h for some h G Hq. Besides, it follows from (3.4), 
applied with p = dv and v € Vn-k-i, that p* = Lq*. Therefore, upon dividing 
(3.9) by d, we get 

fq*-Lq. = ^h, (3.11) 

implying that Uq* = dq*h/d. In particular, q, thus a fortiori q* , divides Uq, in H^, 
whence q* is critical for ^'„_fc by what we said before. Finally, dividing (3.11) by 
q* and taking into account the definition of the reciprocal polynomial, we find that 

wc established the following result. 

Proposition 2. Let q he a critical point of \E'„ and d € Aik be the monic g.c.d. of 
Lq and q with < k < n — 1. Then q* = q/d G Mn-k is an irreducible critical 
point of^n-k, and Lq* /q* interpolates f at the zeros of q* d/z in Hermite's sense 
on C \ D, that is, counting multiplicities including at infinity. 

The converse is equally easy: if q* is an irreducible critical point of ^'n-fe, and 
Lq* /q* interpolates / at the zeros of q* d/z in Hermite's sense for some d € Mu, 
then q = q*d is a critical point of <^„ and d is the monic g.c.d. of q and Lq. This 
we shall not need. 

It is immediately seen from Proposition 2 that a critical point of $ must 
interpolate / with order 2 at the reflections of its poles across T; for best approx- 
imants, this property is classical [22, 26]. 

3.3. Smooth Extension of ^„ 

One of the advantages of "Pn, as compared to $„, is that its domain of definition can 
be compactified, which is essential to rely on methods from differential topology. 
To do that, however, we need to place an additional requirement on /. 

Let us denote by Hq the subset of comprised of functions that extend 
holomorphically across T. Hereafter we will suppose that f G Hq and pick p = 
p{f) < 1 such that / is holomorphic in > p — e} for some e > 0. In particular, 
/ is holomorphic across Tp := {\z\ = p}. 
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Denote by Mn and mI/'' respectively the closure of Mn and the set of 
monic polynomials with zeros in Di/p := {l^l < as usual, wc regard these 

as subsets of C" when coordinatized by their coefficients except the leading one. 
This way mI/'^ becomes an open neighborhood of the compact set Al„, which is 
easily seen to consist of polynomials with zeros in D. Also, q lies on the boundary 
dMn = -Mn \ -Mn if and only if it is a monic polynomial of degree n having at 
least one zero of modulus 1 and no zero of modulus strictly greater then 1. 

For q e A4n, since q/q is unimodular on T, it follows from (3.9) that 



*n(«) 



T 2 



2 



liuqKKr)^.. (3.12) 



In addition, taking into account the Cauchy formula, the analyticity of Lq/q in D, 
the analyticity of / across Tp, and the definition of the cr-operation, we obtain 

= = J-l^fi^J^^, H>,, (3.13) 

L,iz)-Laz) - J^^^ ^, \z\>p. (3.15) 

Now, if g e M.l/'', then g has all its zeros of modulus greater then p, therefore 
(3.15) and (3.13) are well-defined and smooth with respect to the coefficients of q, 
with values in Vn-i and Hq{C \ Dp) respectively. Because evaluation at r e T is 
uniformly bounded with respect to r on //q(C\Dp), in turn extends smoothly 
to Ml/" in view of (3.12). Moreover, differentiating under the integral sign, we 
obtain 



dua , , 1 /' fir) dr , , /„ 

dqj 2m J^^ q{T} z-t 

_ If -T^r{T)q{T) dT 



with 



(z) = TT- \ ' , 1^1 < Vp, (3-18) 

dqj ^t^^Jt,,, 1 (t) t-z 

for j = 0, . . . , n — 1. To recap, we have proved: 

Proposition 3. Let f G -ffo- Then extends to a smooth function in some neigh- 
borhood of Ain and so do Lg and Uq with values in Vn-i and Hq{C \ Dp) respec- 
tively. In addition, (3.16), (3.17), and (3.18) hold. 

We shall continue to denote the extension whose existence is asserted in 
Proposition 3 by ^'/,n5 or simply if / is understood from the context. 
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3.4. Critical points on the boundary 

Having characterized the critical points of on A^„ in Section 3.2, we need now 
to describe the critical points that it may have on dA4n- We shall begin with the 
case where all the roots of the latter lie on T. 

Let f G Ho and assume v{z) = {z — ^)'^, ^ e T, is a critical point of ^'/j. It 
immediately follows from (3.13), the Cauchy formula, and the definition of v that 

v = e'%, where e'^ := (-0'', = 0, and u^, = e-'^f. (3.19) 
In this case, equations (3.17) and (3.18) become 



dqj 2m Jj^ v{t) - - 



Plugging these expressions into (3.16), we obtain 

where := {p < \z\ < 1/p} with positively oriented boundary dAp, and we 
used the Fubini-Tonelli theorem. By taking linear combinations of the previous 
equations, we deduce from the Cauchy formula that 

. f (/r)(T) dr (/r)('-^)(e) , 

Hence v divides /.f^, when viewed as a holomorphic function in Ap. Consequently, 
since ^ S T is a zero of / if and only if it is a zero , we get that / vanishes at ^ 
with multiplicity [{k + 1)/2J, where [x] is the integer part of x. 
Next we consider the case where q is a critical point of having exactly one root 
on T: g = vq* with v{z) = {z - O*', ^ G T, and q* G Mn-k- Denote by Q and V 
some neighborhoods of q* and v, in Ml/fi^ and M].^'' respectively, taking them so 
small that each x S Q is coprimc to each ly Cz V; this is possible since q* and v are 
coprimc. Then, (x, i^) i—t is a diffeomorphism from Q x V onto a neighborhood 
of q in Mn'' . In particular, the fact that g is a critical point of means that q* 
is a critical point of 6 and v a critical point of S, where 

e:Q ^ [0,(X3) S:V ^ [0, ex.) 

Since ux = e*^ux, where e*^ is as in (3.19), it follows from (3.13) that Uy^ = e^'^^u^, 
and therefore by (3.12) that O = '^n-k\Qj implying that q* is a critical point of 
^n-k- In another connection, shrinking V if necessary, we may assume there exists 
g> p such that V C -M.]/^. Put for simplicity w := Uq- = which is clearly an 
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element of Hq by (3.13). Computing with the latter formula yields for any u G V 
that 



where we used the Fubini-Tonelli theorem and the Cauchy integral formula. Thus, 
we derive from (3.12) that 

As w is a critical point of S, we see that it is also critical for ^w,k, so by the case 
previously considered we conclude that w = Uf^q* vanishes at ^ with multiplicity 
[(fc+ 1)/2J . By (3.9), this is equivalent to the fact that Lq*/q* = ig/f/ interpolates 
/ at the zeros of d{z) = [z ~ ^)L(fe+i)/2J Hcrmitc's sense. 

Finally, the case where q is arbitrarily located on dMn is handled the same 
way upon writing q = q*vi . . .ve, where Vj{z) = {z — ^j)'^^ for some G T, and 
introducing a product neighborhood Q x Vi x . . . x of q*vi . . . to proceed 
with the above analysis on each of the corresponding maps O, Si, . . . Thus, 
taking into account Proposition 2 and the fact that Vj and Vj have the same zeros 
in C, we obtain: 

Proposition 4. Let f E Hq and q — vq* , where v = Y[i^~^j)''^ ' '= dcg(w) — k, 
and q* € Ain-k- Assume that q is a critical point of \E'„ = ^'/,n- Then q* is a 
critical point of ^„_fe. Moreover, if we write q* = qidi where di is the monic 
g.c.d. of Lq> and q* , then Lq^/q* = Lq/q interpolates f at the zeros of qi^did/z 
in Hermite's sense on C \ D, where d{z) = Y\{z — ^j) L(fej+i)/2j ^ 

Again the converse of Proposition 4 is true, namely the properties of q* and 
V asserted there imply that q = q*v is critical for This is easy to check by 
reversing the previous arguments, but we shall not use it. 

4. A Criterion for Local Minima 

Let / G H{) and = ^' bo the extended map obtained in Proposition 3, 
based on (3.12) and (3.13). The latter is a smooth real-valued function, defined 
on an open neighborhood of Mn identified with a subset of C" ~ M^" by taking 
as coordinates all coefficients but the leading one. By definition, a critical point 
of is a member of M.n at which the gradient vanishes. This notion is 

of course independent of which coordinates are used, and so is the signature of 
the second derivative, the so called Hessian quadratic form^. A critical point q is 
called nondegenerate if the Hessian form is nonsingular at q, and then the number 



'^This is not true at non-critical points. 



{wu){t) dr _ 1 r If {fq*)it) dt i/(r) dr 
v{t) z-t 27ri 27ri Jj^ q* {t) T-t ?(t) z-t 
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of negative eigenvalues of this form is called the Morse index of q denoted by 
M{q). Observe that nondegenerate critical points are necessarily isolated. 

From first principles of differential topology [23] it is known that (— l)''^^^^ 
which is called the index of the nondegenerate critical point q, is equal to the so- 
called Brouwer degree of the vector field V5'„/|| V^'„||e on any sufficiently small 
sphere centered at q, where || • ||o is the Euclidean norm in R^". 

One can show that dMn is a compact manifold^, so if has no critical 
points on dA4n and only nondegenerate critical points in M.n, then the sum of 
the indices of the critical points is equal to the Brouwer degree of V^'„/|| V4'„||e 
on dMn- The surprising fact is that the latter is independent of f (see [1], [5, Sec. 
5], and [2, Thm. 2]) and is actually equal to 1. Altogether, the following analogue 
of the Poincare-Hopf theorem holds in the present setting. 

The Index Theorem. Let f S Hq and '^f,n be the set of the critical points of ^' /.„ in 
M-n- Assume that all members of^f^n are nondegenerate, and that '^f^nf^dMn = 
0. Then 

(_l)M(g) = 1. 

To us, the value of the index theorem is that if can show every critical point 
is a nondegenerate local minimum and none of them lies on dMn, then the critical 
point is unique. To see this, observe that local minima have Morse index and 
therefore index 1. 

To make this criterion effective, we need now to analyze the Morse index of 
a critical point, starting with the computation of the Hessian quadratic form. 

Let g be a critical point of It is easy to check that the Hessian quadratic 
form of at q is given by 

(n— 1 n— 1 / ^2 T o2 t \ \ 

gE(»,-aj;a^W + «.«^5^W))J. (") 

where we have set v{z) ^ Sj=o^ ^i-^'^ ^ generic element of ■p„_i, the latter being 
naturally identified with the tangent space to 7V{„ at q, and we continue to consider 
Qjt Qj, j € {0, 1}, as coordinates on A^„. Clearly, g' is a nondegenerate local 

minimum if and only if =S is positive definite, i.e., 

Q{v) > for w e Vn-i, v=^0. (4.2) 

Let us assume that q is irreducible, hence q G Mn by Proposition 4. To derive con- 
ditions that ensure the validity of (4.2), we commence by reworking the expression 
for =2. 



^We skim through technical difficulties here, because this manifold is not smooth; the interested 
reader should consult the references we give. 
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Any polynomial in V2n-i can be written piLq + p2q for suitable pi,P2 G 
M.n-1, due to the coprimeness of Lq and q. Therefore 

4./-— ^=0 for any p e p2„-i (4.3) 
q I 

by (3.4) and (3.7). In view of (4.3), differentiating (3.4) with respect to qj and 
evaluating at q leads us to 

which means that d{Lq/ q) / dqj belongs to . Hence, we get from (3.8) that 

As Lq and q are coprime, the polynomials Vj are linearly independent by construc- 
tion, thus we establish a one-to-one linear correspondence on Vn-i by setting 

n— 1 n— 1 

^(^) = X) ^3^^ ^ ^(^) = - m Vjiyjiz). (4.6) 

Moreover, from Proposition 2 where = 1 and q* = q, we can write {compare 
(3.11)) 

f-^ = Mu;^ for some e H^. (4.7) 

q q ^ 

Note that G .ffo since / does, hence Wq is holomorphic across T. Now, it follows 
from (3.7) and (3.1) that 



dqkdqj \ dqkdqj \ q J ' q / \dqj \ q J ' dqk \ q 

by (4.3), (4.5), and the fact that q/q is unimodular on T. Furthermore 
dqkdqj \oqkdqj \ q J q / \dqj \ q ' 



-Wc 



52 

\ dqkdqj \ q J ' q 
by (4.4) and (4.7). Now, a simple computation using (4.5) yields 

fLq \ _ qjd'^Lq/ dqkdqj) - z''{dLq/dqj) + z^{dLq/dqk) _ „ jqi^k 

2 „3 ' 



dqkdqj \ q J 
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and since the first fraction on tlie above riglit-liand side belongs to V2n-i/q, we 
deduce from (4.4) and what precedes that 



z'qvk qq a \ o 



(4.9) 



dqkdqj \ r 9 V \ q 

since q/q is unimodular while q = q on T. So, we get from (4.8), (4.9), and (4.6) 
that 

n— 1 n— 1 ^9 -r / \ / 

=-2(— '<) ='2(-,(^'^«,y 

j=0 fc=0 

and 



n— 1 n—\ 



(a) 



q q 



j=0 k=0 "I'^-'iJ 

Therefore, in view of (4.1) the quadratic form =S/2 can be rewritten as 

2 



1 



2Re( -AvwnY 

.q 



V 








q 







(4.10) 



To manage the above expression, we assume that Lq/q does not interpolate 
/ on T, i.e. that Wq has no zeros there, and we let Q G Mi have the same zeros 
as Wq in B), counting multiplicities. Thus we can write Wq = OqQ/Q, where Oq is 
holomorphic and zero-free on a neighborhood of ^ 



while \Oq\ = \Wq 



on T since 



Q/Q is unimodular there^. Consider now the Hankel operator F, with symbol 
Sq := Lq/{oqqq), i.e. 

U 1-^ P_ {SqU) , 

where P_ is the orthogonal projection from L2 onto H^. Observe that F is well 

defined because Sq is bounded on T, and since the latter is meromorphic in D 
with poles at the zeros of q, counting multiplicities. It is elementary [27] that 
F(7?^) = Vq, that KerF = {q/q)H'^, and that T : Vq ^ Vq \s an isomorphism, 
where Vq := Vn-i/q is readily seen to be the orthogonal complement of KerF in 
. Thus, there exists an operator F# : Vq Vq, which is inverse to r|y_. To 
evaluate F"^, observe from (4.5) that 

n— 1 / -i T \ n— 1 



T{vOq) 



vLq 

qq 



i=o 



qq 



3=0 



q{dLq/dqj) - qvj 



qq 



= E 

J=0 



-qvj_ 

qq 



q 



where we used that {dLq/dqj)/q G H"^ and that v/qG Hq. Hence we may write 



F#(^ 



with uG-iH^ = Kevr, 

q 



(4.11) 



*The function Oq is none but the outer factor of Wq in H'^, see [21, Thm. 2.8]. 
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and since Wq/oq G H°° it follows that uwq/oq G KerF as well, entailing by (4.10) 
and (4.11) that 



because {y/qY GVq = (Kerr)-"-. Altogether, we see that 





2 








V 


- 2 








q 


2 









> (i-2||r#| 



by the Schwarz incquaUty and since \wqloq\ = 1 on T while the a operation 
preserves the norm. The inequalities above imply that J2 is positive definite as 
soon as HF'^II < 1/2. This last inequaUty is equivalent to saying that 2 is strictly 
less than the smallest singular value of T\^, , which is also the n-th singular value 
of r since Vq has dimension n and is the orthogonal complement of Ker T in . 
By the Adamjan-Arov-Krein theorem [27], the singular value in question is equal 
to the error in L°°-best approximation to Sq from H^_i, where -ff^i stands for 
the set of functions of the form h/x where h G and x G -^n-i- Let us indicate 
this approximation number by dn-i- 

C7„_i := inf \\sq - g\\^^ . 

As Sq is holomorphic on a neighborhood of D, it follows from the Adamjan-Arov- 
Krein theory that the infimum is uniquely attained at some Qn-i S which 
is holomorphic on a neighborhood of T, that \sq — gn-i\iO = ""n-i for all ^ G T, 
and that wt (Sg — Qn-i) < — 2n + 1 as soon as ct„_i > 0, where wt stands for the 
usual winding number of a non-vanishing continuous function on T. 

We will appeal to a de la Vallee-Poussin principle for this type of approxi- 
mation, to the effect that 

I, (4.12) 



_i > inf \sq 



whenever g G H^_i is such that 

Wt {sq - .g) < 1 - 2n. 

This principle is easily deduced from the Rouche theorem, for if (4.12) did not 
hold then the inequality 



\{9n-l -9)-{Sq-9)\ = \9n-l 



< So 



would imply that WT(5'n-i — 9) = WT(Sq — g) <1 — 2n, which is impossible unless 
gn-i = g because gn-i — g is meromorphic with at most 2n — 2 poles in D. 

Hence, with our assumptions, that q is an irreducible critical point and that 
/ — Lq/q has no zero on T, we find that =S will be positive definite if there exists 
Ilg G 1Zn-i such that 



2\f-Lq/q\<\Ilq-Lq/q\ 



on 



T and wt(/ - H,) < 1 - 2n. 



(4.13) 
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Indeed, in this case, we will get 
2 < 







Lq n. 




s 


mwqh 




Oqqq qog 




i>q ~ 

QOq 



on T, 



(4.14) 



because \q/q\ 



1 and Iw^l 



on T. Moreover, it follows from (4.13) 



and the triangle inequality that \ f — Lq/q\ < jllq — /|, and therefore 



so that wj{Lq/q — Uq) 

QOq, 



\iLq/q-nq)-{f-Ilq)\<\f-Ilq\ 

wt(/ — Ilq) by Rouche's theorem. Consequently 



'9 ^ 



Wt 



Lq/q-U, 
Oqq 



Wt {Lq/q - Uq) = wt(/ - Uq), 



where wc used that o^g docs not vanish on D. So wc sec that Uq/qOq can be used 
as g in (4.12) to bound an-i from below by a quantity which, by (4.14), is strictly 
bigger than 2. Altogether, rewriting (4.13) in the equivalent form (4.15) below, we 
proved: 

Theorem 2 (Comparison Criterion). Let f <E Hq and q £ A^„ be an irreducible 
critical point of ^'/,n such that f — Lq/q does not vanish on T. Then q is a non- 
degenerate local minimum as soon as there exists Uq e 'R-n-i satisfying 

/ - Lq/q 

In order to use this criterion, we need an appraisal of the error in interpolation 
to / by members of TZn and TZn-i- In the next section, we gather the necessary 
estimates for the class of functions introduced in Theorem 1 after the works [13, 31]. 



2 < 



1 - 



on 



T and wt(/ — 11^) = 1 — 2n. 



(4.15) 



5. Error in Rational Interpolation 

Let us recall the notion of a diagonal multipoint Pade approximant. Conceptually, 
a diagonal (multipoint) Pade approximant to a function g holomorphic in a domain 
f2 C C, is a rational function of type (n, n)^ that interpolates g m & prescribed 
system of 2n + 1 points of f2, counting multiplicities. However, such a definition 
may not work and it is best to adopt a linearized one as follows. Without loss 
of generality, we normalize one interpolation point to be infinity and assume that 
g{oo) = 0. The remaining 2n interpolation points, finite or infinite, form a set l2n 
accounting for multiplicities with repetition. Let Q2n be a polynomial vanishing 
exactly at the finite points of l2n- Then: 

Definition (Pade Approximant). The n-th diagonal (multipoint) Pade approximant 
to g associated, with is the rational function n„ = Pn/^n satisfying: 

• deg pn < n, degin < n, and £n ^ 0; 

• {£n{z)g{z) -Pniz)) /Q2n{z) is analytic in Q; 



*A rational function is said to be of type (rii, 112) if it can be written as the ratio of a polynomial 
of degree at most ni by a polynomial of degree at most n2- 
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• (^„(z)g(z)-p„(z))/Q2„(^) = 0(l/2"+') as 2 ^00. 

The conditions for pn and in amount to solving a system of 2n + 1 homoge- 
neous hncar equations with 2n + 2 unknown coefficients, and clearly no solution 
can be such that in = 0. Moreover it is plain to sec that all pairs {pn,in) define 
the same rational function pn/in, thus a Padc approximant indeed exists uniquely 
with the above definition. As a result of our normalization, note that the third 
condition in the above definition entails at least one interpolation condition at 
infinity, therefore n„ is in fact of type (n — 1, n). 

For our present purpose, we shall be interested only in the case where g = f 
is as in Theorem 1, and X2n consists of n points, each of which appears with 
multiplicity 2. In other words, we let _E„ consist of n points, repeated according to 
their multiplicities, in the analyticity domain of f, say, ~ <C \ {[a,b]U A) where 
[a, b] := supp(/i) and A is the set of poles of r. We further let w„ be the monic 
polynomial whose roots are the finite points of En, and we obtain n„ from the 
previous definition where g = f and Q2n = v^- 

Next, we let n range over N and we put = {£"„} for the interpolation 
scheme, i.e. the sequence of sets of interpolation points. By definition, the support of 
S is supp(<^) := n„£NUfe>„i^fc. We also introduce the probability counting measure 
of En to be the measure with mass 1/n at each point of En, repeating according 
to multiplicities. 

We shall need strong asymptotics on the behaviour of n„ as n ^ oo. To 
describe them, we need some more notation. Let us denote by 

w{z) := \/{z — a){z — b), w{z)/z — > 1, 

the holomorphic branch of the square root outside of [a,b] which is positive on 

{b, +oo), and by 

the conformal map of D := C \ [a, b] into D such that 0(oo) = and ^'(oo) > 0. 
Note that (j) is conjugate-symmetric. 

Recall that the logarithmic energy of a positive Borel measure a, compactly 
supported in C, is given by — / /log \z — t\da{z)da{t), which is a real number or 

+00. 

Definition (Admissibility). An interpolation scheme S is called admissible if the 
sums X^eG-E l'^(^) ~ '^''6 uniformly bounded with n, supp(<^') C Dj, and the 

probability counting measure of En converges weak* to some Borel measure a with 
finite logarithmic energi^. 



®Note that a may not be compactly supported. In this case, pick zo € C \ supp(<?) such that 
20 ^ supp(cr) and set Mzg (z) := 1/{z—zq). Then, all the sets Mzg {En) are contained in a common 
compact set and their counting measures converge weak* to u' such that c' (B) := a{M^^ (B)) 
for any Borel set B <zC What we require is then the finiteness of the logarithmic energy of <t'. 
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The weak* convergence in the above definition is understood upon regarding 
complex measures on C as the dual space of continuous functions with compact 
support. 

To an admissible scheme we associate a sequence of functions on by 
putting 

ii„W = i^(^;.):=n^ia;p||, .^D. (5.2) 

Each i?„ is holomorphic in D, has continuous boundary values from both sides of 
[a, 6], and vanishes only at points of En. Note from the conjugate-symmetry of 
that 

{z) - 0(e) _ 0(z) - 0(e) ^ (/.(e) - (/.(e) 



1 - </.(^)</.(e) 1 - </.(^)(/.(e) V <l>{z) - m 

Thus, i?„ is a Blaschke product with zero set (j){En) composed with 0, times an 
infinite product which is boundedly convergent on any curve separating [a, h] from 
supp((o') by the admissibility conditions. In particular, {-R„} converges to zero 
locally uniformly in D. 

To describe asymptotic behavior of multipoint Fade approximants, we need 
two more concepts. Let /i be a Dini-continuous function on [a, b] . Then the geo- 
metric mean of h, given by 



Gh ■■= exp I J log h{t)du)[a^b] (t) 



is independent of the actual choice of the branch of the logarithm [13, Sec. 3.3]. 
Moreover, the Szego function of h, defined as 

S,{z) := exp 1^ 1 _ 1 |log/i(f)da;[„,,](i)| , z e D, 

does not depend on the choice of the branch either (as long as the same branch 
is taken in both integrals) and is the unique non-vanishing holomorphic function 
in D that has continuous boundary values from each side of [a, b] and satisfies 
h = GhS^S^ and Sh{co) = 1. The following theorem was proved in [13, Thm. 4] 
when r = and in [31] for the general case. 

Theorem 3. Let f be as in Theorem 1, § an admissible interpolation scheme, and 
{n„} the sequence of diagonal Pade approximants to f associated with S . Then 

(f - n„)t« = [2G^ + o{\)\{Si,RnlRf (5.3) 

locally uniformly in D^, where Rn is as in (5.2) and 

m ■■= X{{<t>{z) - <^(e))/(l - <f>{z)<f>{e)), 

the product defining R being taken over the poles ofr according to their multiplicity. 

Let now {g„} be a sequence of irreducible critical points for ^'f,n- Put qn{z) = 
ni<j<„(2: — ^j>). It follows from Proposition 2 that Lq^/qn interpolates f at every 
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1/0," with order 2, hence Lq^/qn is the n-th diagonal Pade approximants associ- 
ated with <S{q^} ■= {{l/fi,n}"=i}- This interpolation scheme of course depends on 
Qn, which accounts for the nonlinear character of the L^-best rational approxima- 
tion problem. The next theorem contains in its statement the Green equilibrium 
distribution of supp(/u) = [a,b], for the definition of which we refer the reader 
to [28]. 

Theorem 4. Let f be as in Theorem 1 and {(/„} be a sequence oj irreducible critical 
points for f. Then S'{q^} is an admissible interpolation scheme, and moreover 

E;=ilM4n)l <const. (5.4) 

where const, is independent of n. Also, the probability counting measures of the 
zeros ofqn converges to the Green equilibrium distribution on supp(/i). In addition, 
it holds that 

(f - Lqjqn)w = [2Gf, + o{l)]{Sf,Rn/Rf (5.5) 
locally uniformly in Dj, where Rn is as in (5.2) and R is as in Theorem 3. 

A few comments on Theorem 4 are in oder. First, the weak* convergence of the 
counting measures of the qn was obtained in [15, Thm. 2.1]. It entails that the 
probability counting measures of the sets -B{g„} converge weak* to the reflection of 
the Green equilibrium measure across T, which has finite energy. The admissibility 
of <^{q„} follows easily from this and from the bound (5.4) which was proven in [4], 
see [31, Lem. 8]. Then relation (5.5) is a consequence of (5.3). 

6. Proof of Theorem 1 

To prove Theorem 1, we follow the line of argument developed in [10, Thm. 1.3]. 
The main difference is that in the present case the critical points are no longer a 
priori irreducible and their poles no longer belong to the convex hull of the support 
of the measure. As we shall see, these difficulties can be resolved with the help of 
Theorem 4. 

Proof of Theorem 1. We claim there exists N = N{f) e N such that all the critical 
points of — 5'f_„ in A^„ are irreducible for n > N. Indeed, assume to the 
contrary that there exists an infinite subsequence of reducible critical point, say 
{quj}- It follows from Propositions 2 and 4 that each q^^ has a factor g*^ such 
that q*. e A4nj-k„. is an irreducible critical point of '^nj-kn '> ^^'^ difii'erence 

2 

f—Lq* I q^. vanishes at the zeros of q*. dn^ where rf„^ is a non-constant polynomial 

of degree at least [{kj + 1)/2J > 1 having all its zeros in {jzj > 1}. Suppose first 
that [nj — knj) —^ooasj—^ oo. Then, the asymptotic behavior of f — Lq- /g* . is 

2 ^ 

governed by (5.5), in particular it can only vanish at the zeros of q^^ for all large n 
which contradicts the assumption that dnj is non-constant. Second, suppose that 
rij — knj remains bounded. Up to a subsequence, we may suppose that rij — knj = I 
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for some integer I. As A^; is compact, we may assume that q*. converges to some 
q e Ml- Since i„ is a smooth function of v in some neighborhood of M.i (see 
Subsection 3.3), the polynomials L„. converge to L„ hence q*i .f — Lq- converges 
to — Lq locally uniformly in Df. In particular, if we pick < p < 1 such that Dp 
contains the zeros of q, we get that f — Lq* /q* is a normal family of functions 

converging to f — Lq/q in \z\ > p. But since the number of zeros it has in C \ D 
increases indefinitely (because it vanishes at the zeros of dn. which are at least 
[(n — / + 1)/2J in number), we conclude that f = Lq/q, which is impossible since 
f is not rational. This contradiction proves the claim. 

As wc just showed, each critical point q of in A4m is irreducible for 
all n large enough, in particular it belongs to Aln and moreover Lq/q does not 
interpolate f on T. Assume further that, for all such n, there exists a rational 
function Hq € TZn-i such that (4.15) holds with / = f. Then q is a local minimum 
by Theorem 2, and therefore it is the unique critical point of order n in view of the 
Index Theorem. Thus, to finish the proof, we need only construct some appropriate 
function 11^ for each critical point q of provided that n is large enough. 

Let ^{q„} be the interpolation scheme induced by {qn} and some admissi- 
ble interpolation scheme, with supp(^y) C {|2:| > 1}. Set {n„} to be the sequence 
of diagonal Padc approximants to f associated with S",^. Then Theorems 3 and 4 
imply that when n ^ oo 

-[l + o{l)]( ^-f''['\ y uniformly on T. (6.1) 



if -Lqjqn){z) ' ' ''\Rn{<^[q^y,z) 

Moreover, for all n large enough, (f — n„_i) is holomorphic outside of D by (5.3) 
and it has 2n—l zeros there, namely those of Iin-i{<Sv', ■)■, counting multiplicities, 
plus one at infinity. Consequently WT(f — n„_i) = 1 — 2n for all such n, and so 
(4.15) will follow from (6.1) upon constructing S^, such that, for n large enough. 



\Rn{S{^q^y,z) 



> 2 on T. (6.2) 



For convenience, let us put / := [a,b] = supp(/x) and := {x : 1/x € /}, 
together with := C \ (/ U J^^). Set p : Q ^ {1 < \z\ < A} to be the conformal 
map such that q{I) — T, g{I~^) = Ta, lim^_i,+ g{z) = 1; as is well known, the 
number A is here uniquely determined by the so-called condenser capacity of the 
pair (7, [28]. Note also that, by construction, g is conjugate-symmetric. Define 

_ l-{g{z)A)^ 
g^{z)-A^ ' 

which is a well-defined holomorphic function in O := C\ (/U/~^). It is not difiicult 
to show (c/. the proof of [10, Thm 1.3] after eq. (6.24)) that \l - K\ > 2 on T. 
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Thus, to prove our theorem, it is sufficient to find such that 



-^"(^{gn}!^) 



W -\- o{V)\hv(z) uniformly on T. 



(6.3) 



For this, we shall make use of the fact, also proven in the course of [10, Thm 1.3], 
that hv can be represented as 



hv{z) := exp 



{z) - <i){x) 

where is a signed measure of mass 2 supported on . 

Denote by the zeros of and by {xj_„}"^j their real parts. Ob- 

serve from (5.5) that any neighborhood of the poles of r which is disjoint from 
/, contains exactly m zeros of g„ for all n large enough. We enumerate these as 
The rest of the zeros of qn we order in such a manner that 

a < Xi^n < X2,n < ■ ■ < Xd„,n < b, 

while those j € {d„ + 1, . . . , n — m} for which xj^n either lies outside of (a, b) or 
else coincides with Xk,n for some fc € {1, . . . , (i„}, are numbered arbitrarily. Again 
from (5.5), any open neighborhood of I contains {Cj,n}j=i" for all n large enough, 
and therefore 



max |Im(^j-„)| 
je{i,...,d„} 



as n ^ oo. 



{6.4 



In addition, as the probability counting measures of the zeros of g„ converge to a 
measure supported on the whole interval /, namely the Green equilibrium distri- 
bution, we deduce that d„/n 1 and that 



6!y := ma,x< (xin — ci),(b— Xci„ n), max (x, « — a;,_i „) > ^ as n oo. 
t je{2,...,dn} J ' 

(6.5) 

Define u to be the image of v under the map t ^ 1/t, so that is a signed 

measure on / of mass 2. Let further (p{z) := (f>{l/z) be the conformal map of 
C\I~^ onto D, normalized so that (p{0) = and (p'{Q) > 0, Finally, set 

ip{z) - ip{t) 



K{z,t) :=log 



1 - ip{z)ip{t) 

To define an appropriate interpolation scheme S'„, we consider the coefficients: 



Xl.n + -r-I.n 



a, ■ 



jG{2,...,dn-l}, 
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Subsequently, we define two other sets of coefScients 



fe=l 

:= 2 — bin = i 



a, 



j € {l,...,dn- 1}, 



and 5o,„ = ad„,Ti '■= 0. It follows in a straightforward manner from the definitions 

that 

2 '^j,n ~ ^j — l,n "I" '^j,m j € {!> • • • ) '^tiIj 

and therefore 

(6.6) 

Next, we introduce auxiliary points yj^n by setting 



yj,n 



j e {i,...,rf„- 1}. 



Observe that 



(6.7) 



where is the total variation of j>. 

Let /C be compact in fl and U C H he a. neighborhood of I whose closure 
is disjoint from JC. By (6.4), (6.5), and (6.7) we see that both {^j^n}jZi C U and 
{yj,n}j=i^ C U for all n large enough. Thus, for such n and z € K,, we can write 
the first-order Taylor expansions: 

d 

K{Z, ij,n) - K{Z, Vj^n) = ol^i^' yj,n){^j,n - yj,n) + O ((4„ - yj,nf) , (6.8) 

d 

K{z, ^3 + l,n) - K{z, yj^n) = ^-^(^' %>)fe + l,n " %,n) + O {{^j + l,n " %,n)^) , 

(6.9) 

and adding up (6.8) multiplied by aj,„ to (6.9) multiplied by 6j,„ we obtain 

bj,nK{z, ^j+l,n) + aj^nK{z, ^j^n) - '^K{z, yj^n) = O {{^j+l,n - ^j,nf) , (6.10) 

where we took (6.7) into account and, of course, the three symbols big "Oh" used 
above indicate different functions. By the smoothness oi K on C\ I^^ x C \ /^^ 
and the compactness oi IC xU, these big "Oh" can be made uniform with respect 
to z G )C, being majorized by 

C^2\\P\\ sup _ 
{z,t)etcxu 
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In another connection, it is an immediate consequence of (6.4), (6.5), and (5.4) 
that 

j=l j=l j=l 

< (6 - a)5;* + const-fj;™ = o(l). (6.11) 

Therefore, we derive from (6.11) upon adding equations (6.10) for j e {1, . . . , d„ — 

1} that 



i=l ,7 = 1 ,7 = 1 



= 0(1), (6.12) 



where o(l) is uniform with respect to z € /C. In view of (6.6), equation (6.12) can 
be rewritten as 



0(1). (6.13) 



i=i i=i j=i 

Now, it follows from (6.5) and the definitions of cj^n and hi, that 

dn p 

'Y^Cj^nK{z,Xj^n) ^ I K{z,t)di>{t) = —log\hv{l/z)\ as n — > oo, (6.14) 
uniformly with respect to z & K,. Moreover, we deduce from (6.4) and (5.4) that 

d„ dn 

X |Ci,n(-f (^, 6>) - K{Z, Xj^n))\ < |Cj- „||Im(^j,„)| 

j=i i=i 

< C|ii>|i(5r - 0, (6.15) 

as n — > oo, where C = sup^^^ ^-jg^^j^ |9X/9t(z, f)|. Hence, combining (6.14) and 
(6.15) with (6.13), we get 

d„-l dn 

2 ^ i^(0,2/,-„)-2gif(0,4„)^log|/i,(l/0)| as n^oo, (6.16) 
uniformly on /C. Define 

2 



IW — 

which is holomorphic in O. By (6.16), it holds that log|5(„(2;)| — > log |/i^(l/2)| 
as n ^ oo uniformly on /C, and since the latter was arbitrary in Q this con- 
vergence is in fact locally uniform there. Thus, {gn} is a normal family in f2, 
and any limit point of this family is a unimodular multiple of /ii/(l/-). However, 
lim2^(,+ hi,{z) = 1 while it follows immediately from the properties of if that each 
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Qn has a well-defined limit at 1/6 which is also 1. So, {(/„} is, in fact, a locally 
uniformly convergent sequence in and its limit is h^{l/-). 

Finally, set S'„ := {-E,.,™}, whore Ei^^n = {0,™}, Ci,n l/2/j,n+i when j e 
{1, . . .,d„+i - 1}, and Ci,n l/^j+i,„+i when j e {d„+i, . . . ,n}. Then 

{Rn-iiAy: ■AlRn{^{q^y,z)f - .9„(1/Z) 

and (6.3) follows from the limit just proved that {gn\ h,^(l/-). Thus, it only 
remains to prove that S'l, is admissible. To show the first admissibility condition, 
put 

n-1 

■■= Y l'?^(C7>-l) - 'P{Cj,n-l)\- 

Then, since 

|Im(yj>)| < M(|lm(0,„)| + |Im(C,+i,„)|) , 1 < j < d„ - 1, 
by the very definition of we get 



-1 

/ , \'r\y3,n/ — Yyyjyn)] -r- Y^ 



= V |<yj(yj>)-<p(yj,„)|+ V |¥'fe+l,„)-¥'fe + l,n) 



-1 



< 2supi¥''i I ii"i(j/j>)i + Y 



u 

< 2sup|^'| [ 2||j/||E|Im(^j>)| + Y |Imfe>y 

which is uniformly bounded by (5.4). Further, since each E^^n is contained in 
U~^, we have that supp(<^y) C {\z\ > 1}. So, it only remains to show that the 
probability counting measures of E^^n converges weak* to some Borcl measure 
with finite logarithmic energy. Now, since dn/n — > 1 as n oo, and by the 
remark made in footnote 3, it is enough to prove that this property holds for 
the probability counting measures of the points {yj,n}- But from (6.7) and (6.11), 
the latter have the same asymptotic distribution as the points {£,j,n}, namely 
the Green equilibrium distribution on I by Theorem 4. This finishes the proof of 
Theorem 1. □ 
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